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ONE- VARIABLE WORD EQUATIONS IN LINEAR TIME 

ARTUR JEZ 

Abstract. In this paper we consider word equations with one variable (and arbitrary many appear- 
ances of it). A recent technique of recompression, which is applicable to general word equations, is 
shown to be suitable also in this case. While in general case it is non-deterministic, it determinises 
in case of one variable and the obtained running time is 0{n + #xlogn), where #x is the number 
of appearances of the variable in the equation. This matches the previously-best algorithm due to 
Dqbrowski and Plandowski. Then, using a couple of heuristics as well as more detailed time analysis 
the running time is lowered to 0{n) in RAM model. Unfortunately no new properties of solutions are 
shown. 



1. Introduction 

1.1. Word equations. The problem of satisfiability of word equations was considered as one of 
the most intriguing in computer science and its study was initiated by Markow already in the '50. 
The first algorithm for it was given by Makanin [13], despite earlier conjectures that the problem 
is in fact undecidable. The proposed solution was very complicated, both in terms of proof-length, 
algorithm and computational complexity. It was improved several times, however, no essentially 
different approach was proposed for over two decades. 

An alternative algorithm was proposed by Plandowski and Rytter |19] , who presented a very simple 
algorithm, which in essence guessed a compressed representation of a solution. The running time of 
this nondeterministic algorithm was polynomial in n and log N, where N is the length of the length- 
minimal solution. However, at that time the only bound on such length followed from Makanin's work 
(with further improvements) and it was triply exponential in n. 

Soon after Plandowski showed, using novel factorisations, that N is at most doubly exponential |16] . 
showing that satisfiability of word equations is in NEXPTIME. Exploiting the interplay between fac- 
torisations and compression he improved the algorithm so that it worked in PSPACE |17] . 

Producing a description of all solutions of a word equation, even when a procedure for verification 
of its satisfiability is known, proved to be also of non-trivial task. Still, it is also possible to do this in 
PSPACE |18], though insight and non-trivial modifications to the earlier procedure are needed. 

On the other hand, it is only known that the satisfiability of word equations is NP-hard. 

1.1.1. Two variables. Since in general the problem is outside P, it was investigated, whether some 
subclass is feasible, with a restriction on the number of variables being a natural candidate. It was 
shown by Charatonik and Pacholski [5] that indeed, when only two variables are allowed (though with 
arbitrarily many appearances), the satisfiability can be verified in deterministic polynomial time. The 
degree of the polynomial was very hight, though. This was improved over the years and the best 
known algorithm is by D^browski and Plandowski [H] and it runs in 0{n^) and returns a description 
of all solutions. 

1.1.2. One variable. Clearly, the case of equations with only one variable is in P. Constructing a cubic 
algorithm is almost trivial, small improvements are needed to guarantee a quadratic running time. 
First non-trivial bound was given by Obono, Goralcik and Maksimenko, who devised an 0{nlogn) 
algorithm [15]. This was improved by D^browksi and Plandowski [1] to 0{n + ^x^ogn), where 

is the number of appearances of the variable in the equation. Furthermore they showed that there 
are at most 0(logn) distinct solutions and at most one infinite family of solutions. Intuitively, the 
0{#x log n) summand in the running time comes from the time needed to find and test these ©(log n) 
solutions. 
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This work was not completely model- independent, as it assumed that the alphabet S is finite or that 
it can be identified with numbers. A more general solution was presented by Laine and Plandowski |TT| , 
who improved the bound on the number of solutions to ©(log (plus the infinite family) and gave 
an 0{nlog^x) algorithm that runs in a pointer machine model (i.e. letters can be only compared 
and no arithmetical operations on them are allowed); roughly one candidate for the solution is found 
and tested in linear time. 

1.2. Recompression. Recently, the author proposed a new technique of recompression based on pre- 
vious techniques of Mehlhorn et. alfI3] (for dynamic text equality testing), Lohrey and Mathissen 
(for fully compressed membership problem for NFAs) and Sakamoto |20j (for construction of the 
smallest grammar for the input text). This method was successfully applied to various problems re- 
lated to grammar-compressed strings El IE] • Unexpectedly, this approach was also applicable to 
word equations, in which case alternative proofs of many known results were obtained using a unified 
approach 

The technique is based on iterative application of two replacement schemes performed on the text 

t: 

pair compression of ab: For two different letters a, b such that substring ab appears in t replace 

each of ab in i by a fresh letter c. 
a's block compression: For each maximal block o^, where a is a letter and £ > 1, that appears 

in t, replace all a^s in i by a fresh letter ag. 

In one phase, pair compression (block compression) is applied to all pairs (blocks, respectively) that 
appeared at the beginning of this phase. Ideally, each letter is then compressed and so the length of 
t halves, in a worst-case scenario during one phase t is still shortened by a constant factor. 

The surprising property is that such a schema can be efficiently applied even to grammar-compressed 
data [5l l6l or to text given in an implicit way, i.e. as a solution of a word equation [7] . In order to do so, 
local changes of the variables (or nonterminals) are needed: X is replaced with a^X (or Xa^), where 

is prefix (suffix, respectively) of the substitution for X. In this way the a solution that substitutes 
a^w for X is implicitly replaced with one that substitutes w. 

1.2.1. Recompression and one-variable equations. Clearly, as the recompression approach works for 
general word equations, it can be applied also to restricted subclasses. However, while in case of word 
equations it heavily relies on the nondeterminism, when restricted to instances with one variable it 
can be easily determinised. Furthermore, a fairly natural implementation has 0{n + klogn) running 
time, so the same as the D^browski and Plandowski algorithm [4j; this is presented in Section |3j 
Furthermore adding a few heuristics ,data structures as well as applying a more sophisticated analysis 
yields a linear running time, this is described in Section [4} 

1.3. Outline of the algorithm. In this paper we present an algorithm for one-variable equation 
based on the recompression. It also provides a compact description of all solutions of such an equation. 
Intuitively: when pair compression is applied, say ab is replaced by c (assuming it can be applied) 
then there is a one-to-one correspondence of the solutions before and after the compression, this 
correspondence is simply exchange of all abs by cs and vice-versa. The same applies to the block 
compression. On the other hand, the modification of X can lead to loss of solutions (note that for 
technical reasons we do note consider the solution S{X) = e): when X is to be replaced with a^X the 
new equation has corresponding solutions for S other than S{X) = a^ . So before the replacement, 
it is tested, whether S{X) = is a solution and if so, it is reported. The test itself is simple: both 
sides of the equation are read and their values under substitution S{X) = are created on the fiy 
and compared symbol by symbol, until a mismatch is found or both strings end. 

It is easy to implement the recompression so that one phase takes linear time. Then the cost can 
be distributed to explicit words between the variables, each of them is charged proportionally to its 
length. Consider such a string w, if it is long enough, its length decreases by a constant factor in 
one phase, see Lemma |8j Thus, the cost of compressing this fragment and testing a solution can be 
charged to the lost length. However, this is not true when w is short and the ij^x log n summand in 
the running time comes from bounding the running time for such 'short' strings. 

In Section |4] it is shown that using a couple of heuristics as well as more involved analysis the 
running time can be lowered to 0{n). The mentioned heuristics are as follows: 
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• The problematic 'short' words between the variables need to be substrings of the long words, 
this allows smaller storage size and consequently faster compression. 

• when we compare XwiXw2 ■ ■ ■ WmX from one side of the equation with its copy appearing on 
the other side, we make such a comparison in 0{1) time (using suffix arrays); 

• (S'(X)n)™' and (S'(X)n')™' (perhaps offsetted) are compared in 0{\u\ + \u'\) time instead of 
naive 0{'m ■\u\ + m - |n|), using simple facts from combinatorics on words. 

Furthermore a more insightful analysis shows that problematic 'short' words in the equation invalidate 
several candidate solutions fast. This allows a tighter estimation of the time spent on testing the 
solutions. 

A note on model. In order to perform the recompression efficiently, some algorithm for grouping pairs 
is needed. When we can identify the symbols in S with consecutive numbers, the grouping can be 
done using RadixSort in linear time. Thus, all (efficient) applications of recompression technique make 
such an assumption. On the other hand, the second of the mentioned heuristics craves checking string 
equality in costant time, to this end a suffix array [U] plus a structure for answering longest common 
prefix query (Icp) is employed [lU] on which we use range minimum queries pQ. The last structure 
needs the flexibility of the RAM model to run in 0(1) time per query. 

2. Preliminaries 

2.1. One- variable equations. Consider a word equation A = B over one variable X. Then without 
loss of generality one of A and B begin with a variable and the other with a letter: 

• if they both begin with the same symbol (be it letter or nonterminal), we can remove this 
symbol from them, without affecting the set of solutions, 

• if they begin with different letters, this equation clearly has no solution. 

The same applies to the last symbols of U and V. Thus, in the following we assume that the equation 
is of the form 

(1) AqXAi . . . An_^-lXAn^ = XBi . . . Bng-lXBn^ , 

where Ai,Bi G T,* and (n-is) denote the number of X appearances in A {B, respectively). Note that 
exactly one of ^n^j -B„e is empty and Aq is non-empty. If this condition is violated for any reason, 
we greedily repair by cutting letters from appropriate strings. We say that Aq is the first word of the 
equation and the non-empty of A^j^ and B^^ is the last word. 

A substitution S assigns a string to X, we expand it to {X U S)* with an obvious meaning. A 
solution is a substitution such that S{A) = S{B). For a given equation ^ = ;B we are looking for a 
description of all its solutions. We treat the empty solution S{X) = e in a special way and always 
assume that S[X) ^ e. 

Note that if S{X) ^ e, then using ([T| we can always determine the first (a) and last (6) letter of 
S{X) in 0{\) time. In fact, we can determine the length of the o-prefix and 6-suffix of S{X). 

Lemma 1. If Aq G a* then S{X) € a* for each solution S of A = B. 

If the first letter of Aq is a and Aq ^ a* then there is at most one solution S{X) € a*, existence 
of such a solution can he tested (and its length returned) in 0(1^1 -|- \B\) time. Furthermore, for 
S{X) ^ a* the lengths of the a-prefixes of S{X) and Aq are the same. 

Proof. Consider the case when G and suppose that S{X) ^ a*, let ^ > be the length of the 
a-prefix of S{X). The length of the a-prefix of S{A) is then |^o| + ^ > ^) which is the length of the 
a-prefix of S{B), contradiction. Hence S{X) £ a*. 

Consider now the case when begins with a but Aq ^ a* , let its a-prefix has length Ia- Consider 
S{X) G a"*", say S{X) = a^. Let the first letter other than a in ;B be the £b -|- 1 letter in B and let 
it be in block Bi. If there is no such Bi then there is no solution S{X) £ a"*", as then S{B) consists 
only of as, which is not tue for S{A). The length of the a-prefix of «S'(^) is Ia, while the length of the 
a-prefix of S{B) is Ib + i ' ^- Hence i = and this is the only candidate for the solution. 

It is easy to verify whether S{X) = a^ is a solution for a single ^ in linear time. It is enough to 
compare S{A) and S{B) letter by letter, note that thy can be created on the fly while reading A and 
B. Each such comparison consumes one symbol from A and B (note that if we compare a suffix of 
S{X), i.e. some for / < with S{X) = we simply remove from both those strings), so the 
running time is linear. 
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Lastly, consider S{X) ^ a*. Then the a-prefix of S{A) has length £a and as S{X) ^ a+, the a-prefix 
of S{B) is the same as the o-prefix of S{X), which consequently has length £a. □ 

Symmetric version of Lemma [T] holds for for the suffix of S{X). 

By TestSimpleSolution(a) we denote a procedure, described in Lemma[T] that for ^ cl* establishes 
the unique possible solution S{X) = a^, tests it and returns £ if this indeed is a solution. 

2.2. Representation of solutions. Consider any solution S A = B. We claim that S{X) is 
uniquely determined by its length and so when describing solution of „4 = ;S it is enough to give their 
lengths. If |5'(X)| < |j4o| then S{X) is a prefix of Aq, so S{X) is uniquely determined by its length. 
When \S{X)\ > \Aq\ then S{A) begins with AoS{X) while S{B) begins with S{X) and thus S{X) has 
a period Aq. Consequently, it is of the form AqA, where A is a prefix of ^o- So also in this case S{X) 
is uniquely determined by its length. 

Each letter in the current instance of our algorithm represents some string (in a compressed form) 
of the input equation, we store its weight which is the length of such a string. Furthermore, when we 
replace X with a^X (or Xa^) we keep track of the weight of a^. In this way, for each solution of the 
current equation we know what is the length of the corresponding solution of the original equation. 

2.3. Recompression. We recall here the technique of recompression [HEIIT], restating all important 
facts about it. Note that in case of one variable many notions simplify. 

2.3.1. Preserving solutions. All subprocedures of the presented algorithm should preserve solutions, 
i.e. there should be a one-to-one correspondence between solution before and after the application of 
the subprocedure. However, as some of the replace X with a^X (or Xb"^), some solutions may be lost 
in the process and so they should be reported. We formalise these notions. 

We say that a subprocedure preserves solutions when given an equation A = B it returns A' = B' 
such that for some strings u and v (calculated by the subprocedure) 

• some solutions of ^ = ;B are reported by the subprocedure, 

• for each unreported solution S of A = B there is a solution S' of A' = B' , where S{X) = 
uS'{X)v and S'{A') = uS{A)v 

• for each solution S' of A' = B' the S{X) = uS'{X)v is an unreported solution of ^ = i3. 

By PCab^c{w) we denote the string obtained from w by replacing each ab by c (we assume that 
a ^ b, so this is well-defined), this corresponds to pair compression. We say that a subprocedure 
properly implements pair compression for ab, if it satisfies the conditions for preserving solutions 
above, but with PCab^c{S{X)) = uS'{X)v and PCab^c{S{A)) = uS'{A')v replacing S{X) = uS'{X)v 
and A = uAv. Similarly, by JiCa{w) we denote a string with maximal blocks a^ replaced by and 
we say that a subprocedure properly implements blocks compression for a letter a. 

Given an equation A = B, its solution S and a pair ab £ appearing S{U) (or S{V)) we say 
that this appearance is explicit, if it comes from substring ab of U (or V, respectively); implicit, if it 
comes (wholly) from S{X); crossing otherwise. A pair is crossing if it has a crossing appearance and 
non-crossing otherwise. Similar notion applies to maximal blocks of as, in which case we say that a 
has a crossing block or it has no crossing blocks. Alternatively, a pair ab is crossing if b is the first 
letter of S{X) and aX appears in the equation or a is the last letter of S{X) and Xb appears in the 
equation or a is the last and b the first letter of S{X) and XX appears in the equation. 

Unless explicitly stated, we consider crossing/non-crossing pairs ab in which a ^ b. Note that as the 
first (last) letter of S{X) is the same for each S, the definition of the crossing pair does not depend 
on the solution; the same applies to crossing blocks. 

When a pair ab is non-crossing, its compression is easy, as it is enough to replace each explicit ab 
with a fresh letter c 



Algorithm 1 PairCompNCr(a, 6) Pair compression for a non-crossing pair 

1: let c G r be an unused letter 

2: replace each explicit ab in A and i3 by c 



Similarly when none block of a has a crossing appearance, the a's blocks compression consists simply 
of replacing explicit a blocks. 
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Algorithm 2 BlockCompNCr(a) Block compression for a letter a with no crossing block 

1: for each explicit a's £-block appearing \nU oi V do 
2: let G r be an unused letter 

3: replace every explicit a's £-block appearing in ^ or ;B by ai 



Lemma 2. Let ah he a non-crossing pair then PairCompNCr(a, 6) properly implements the pair com- 
pression for ah. 

Let a has no crossing hlocks, then BlockCompNCr(a) properly implements the hlock compression for 

a. 

Proof Consider first the case of PairCompNCr. Suppose that A = B has a solution S. Define S': 
S'{X) is equal to S{X) with each ah replaced with c (where c is a new letter). Consider S{A) 
and S'{A'). Then S'{A') is obtained from «S'(^) by replacing each ah: the explicit appearances of 
ah are replaced by PairCompNCr(a, 5), the implicit ones are replaced by the definition of S' and by 
the assumption there are no crossing appearances. The same applies to S{B) and S'{B'). Hence 
S'[A!) = VCab-^cS[A) = VCab^cSiB) = S'{B'). The proof in the other direction is the same: for S' 
we define S such that S{X) is obtained form S{X) by replacing each c by ah. It can be easily shown 
that S{A) = S{B) for such an S, furthermore S'{A!) = VCab^cS{A) and S'{B') = VCab~,cS{B). 
The proof for the block compression follows in the same way. □ 

The main idea of the recompression method is the way it deals with the crossing pairs: imagine 
that ah is a crossing pair, this is because S{X) = hw and aX appears in ^ = ;B or S{X) = wa and hX 
appears in it (the remaining case, in which S{X) = awh and XX appears in the equation is treated 
in the same way). The cases are symmetric, so we deal only with the first one. To 'uncross' ab in 
this case it is enough to 'left-pop' h from X: replace each X in the equation with bX and implicitly 
change the solution to S[X) = w. 



Algorithm 3 Pop(a, h) 

1: if 6 is the first letter of S{X) then 

2: if TestSimpleSolution(6) returns 1 then 

3: report solution S{X) = h 

4: replace each X m A = Bhy hX 

5: if a is the last letter of S{X) then 

6: if TestSimpleSolution(a) returns 1 then 

7: report solution S{X) = a 

8: replace each X \n A = Bhy Xa 



S{X) = 6 is a solution 
> Implicitly change S{X) = hw to S{X) = w 
> S{X) = a is a solution 
t> Implicitly change S{X) = w'a to S{X) = w' 



Lemma 3. Pop(a, h) preserves solutions and after its application the pair ah is noncrossing. 

Proof. We show that a6 in ^ = ;B is noncrossing. Consider whether X was replaced by bX is line |4| 
If not, then the first letter of S{X) and S'{X) is not b, so ah cannot be crossing because of X prefix. 
Suppose that X was replaced with hX. Then the letter to the left of X is not a, and so ab cannot be 
crossing because of X prefix. 

A similar analysis is applied to the right-end of X, which yields that ab cannot be a crossing pair. 

Observe that each solution reported by Pop is verified, so it is indeed a solution. Furthermore, 
as only one-letter solutions are returned, they correspond to e solutions of A' = B', which are not 
considered. 

Consider first the modification performed by in the if-statement from line[T} If S{X) does not begin 
with h (recall that all solutions have the same first letter) then nothing changes and the set of solutions 
of preserved. If S{X) = bw and additionally 
w = e: then it is reported in line |3j 

w ^ e: then S'{X) = w is a solution of the obtained equation. 
The same analysis for the if-statement from line [5] yields the claim of the lemma. □ 

Now the presented procedures can be merged into one procedure that turns crossing pairs into 
noncrossing ones and then compresses them, effectively compressing crossing pairs. 
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Algorithm 4 PairComp(a, b) Turning crossing pair ab into non-crossing ones and compressing it 

1: run Pop(a, b) 

2: run PairCompNCr(a, 6) 



Lemma 4. PairComp(a, 6) properly implements the pair compression of the pair ab. 
The proof fohows by combining Lemma [2] and |3] 

There is one issue: the number of non-crossing pairs can be large, however, a simple preprocessing, 
which basically applies Pop, is enough to reduce the number of crossing pairs to 2. 



Algorithm 5 PreProc Ensures that there are at most 2 crossing pairs 

1: let a, b be the first and last letter of S{X) 
2: run Pop(a, b) 



Lemma 5. PreProc preserves solution and after its application there are at most two crossing pairs. 

Proof. It is enough to show that there are at most 2 crossing pairs, as the rest follows form Lemma |3j 
Let a and b be the first and last letters of S{X), and a', b' such letters after the application of PreProc. 
Then each X is proceeded with a and succeeded with b in A' = B'. So the only crossing pairs are aa' 
and b'b (note that this might be the same pair). □ 

The problems with crossing blocks can be solved in a similar fashion: a has a crossing block, if and 
only if aa is a crossing pair. So we 'left-pop' a from X until the first letter of S{X) is different than 
a, we do the same with the ending letter b. This can be alternatively seen as removing the whole 
a-prefix (6-suffix, respectively) from X: suppose that S{X) = a^wb^, where w does not start with a 
nor end with b. Then we replace each X by a^Xh^ implicitly changing the solution to S{X) = w, see 
Algorithm |6j 



Algorithm 6 CutPrefSuff Cutting prefixes and suffixes 

1: let a be the first letter of S{X) 
2: report solution found by TestSimpleSolution(a) 
3: let £ > be the length of the a-prefix of S{X) 
4: replace each X in ^ = ;B by a^X 
5: 

6: let b be the last letter of S{X) 
7: report solution found by TestSimpleSolution(6) 
8: let r > be the length of the 6-suffix of S{X) 
9: replace each X m A = Bhy XU' 

10: 



i> is stored in a compressed form, 
t> implicitly change S{X) = a^w to S{X) = w 



> is stored in a compressed form, 
implicitly change SiX) = wb^ to S(X) = w 



Lemma 6. CutPrefSuff preserves solutions and after its application there are no crossing blocks of 
letters. 

Proof. Consider first only the changes done by the modification of the prefix. Suppose that S{X) = 
a^w, where w does not begin with a. If vu = e then this solution is reported in line [2] Otherwise, the 
S'{X) = w is the solution of the new equation. Similarly, for any solution S'{X) = w the S{X) = a^w 
is the solution of the original equation. 

The same analysis can be applied to the modifications of the suffix. 

Lastly, suppose that some letter c has a crossing block, without loss of generality assume that c is 
the first letter of S{X) and cX appears in the equation. But this is not possible: X was replaced by 
a^X and so the only letter to the left of X is a and S{X) does not start with a, contradiction. □ 

The CutPrefSuff allows defining a procedure BlockComp that compresses maximal blocks of all 
letters, regardless of whether they have crossing blocks or not. 
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Algorithm 7 BlockComp Compressing blocks of a 


1: Letters ^ letters appearing in the equation 




2: run CutPrefSuff 


> Removes crossing blocks of a 


3: for each letter a £ Letters do 




4: BlockCompNCr(a) 





Lemma 7. BlockComp properly implements the block compression for letters present in A = B before 
its application. 

The proof follows by combining Lemma [2] and |6j 

3. Main algorithm 

The following algorithm OneVarWordEq is basically a simplification of the general algorithm for 
testing the satisfiability of word equations [7] . 



Algorithm 8 OneVarWordEq Reports solutions of a given one-variable word equation 
1: while \Ao\ > 1 do 

2: BlockComp Compress blocks, in 0(1^1 + \B\) time. 

3: PreProc > There are only two crossing pairs, see Lemma [5] 

4: Crossing ^ list of crossing pairs > There are two such pairs 

5: Non-Crossing ^ list of non-crossing pairs 

6: for each ah £ Non-Crossing do t> Compress non-crossing pairs, in time 0(1^1 -|- 

7: PairCompNCr(a, 6) 

8: for ah £ Crossing do > Compress the 2 crossing pairs, in time 0(1^1 -|- \B)\ 

9: PairComp(a, 6) 

10: TestSolution > Test solutions from a*, see Lemma [9] 



We call one iteration of the main loop a phase. 

Theorem 1. OneVarWordEq runs in time 0(1^1 -|- \B\ + (ra^^-l-ng) log(|^| -|- \B\)) and correctly reports 
all solution of a word equation A = B. 

Before showing the running time, let us first comment on how the equation is stored. Each of sides 
{A and B) is represented as a table of pointers to strings, i.e. to Aq, Ai, . . . , and Bq, Bi, . . . , 
Bfig. Each of those words is stored as a doubly- linked list. When we want to refer to a concrete word 
in a phase, we use names Ai and Bj, when we want to stress its evolution in phases, we use names A 
i-word and B j-word. 

The most important property of OneVarWordEq is that the explicit strings between the variables 
shorten (assuming that they have a large enough length). 

We say that a word Ai (Bi) is short if it consists of at most 100 letters and long otherwise. To 
avoid usage of strange constants and its multiplicities, we shall use = 100 to denote this value. 

Lemma 8. Consider the length of the A i-word (or B j-word). If it is long then its length is reduced 
by 1/4 in this phase. If it is short then after the phase it still is. The length of each unreported solution 
is reduced by at least 1 /4 in a phase. 

Additionally, if the first (last) word is short then its length is shortened by at least 1 in a phase. 

Proof. We shall first deal with the words and then comment how this argument extends to the solutions. 
Consider two consecutive letters a, b in any word at the beginning of a phase. We show that at least 
one of those letters is compressed in this phase: 

a = b: Then they are compressed using BlockComp. 

a ^ b: If one of them is compressed by BlockComp then we are done, so suppose not. Then ab 
is a pair appearing in the equation and as such we try to compress it, either in line [7] or in 
line [9j This appearance cannot be compressed only when one of the letters a, b was already 
compressed, in some other pair. 
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This means that in a word of length k during the phase at least ^^^g"'*'^ letters are compressed (since 
we can associate with each uncompressed letter the two neighbouring compressed letters, moreover, 
one compressed letter is associated with only one uncompressed letter) i.e. its length is reduced by at 
least letters. 

On the other hand, letters are introduced into words by popping them from variables. Let symbol 
denote a single letter or block that is popped into a word, we investigate, how many symbols are 
introduced in this way in one phase. At most one symbol is popped to the left and one to the right 
by BlockComp in line[T| the same holds for PreProc in line [3] Moreover, one symbol is popped to the 
left and one to the right in line [7j since this line is executed twice, this yields 8 symbols in total. Note 
that the symbols popped by BlockComp are replaced by a single letter, so the claim in fact holds for 
letters and not symbols. 

So, consider any word Ai G T* (the proof for Bj is the same), at the beginning of the phase and 
let A[ be the corresponding word at the end of the phase. There were at most 8 symbols introduced 
into A[ (some of them might be compressed later). On the other hand, we already established that at 
least ^"^g""^ letters were removed due to compression. Hence 

L4 < Ai - + 8 < + 8- . 

It is easy to check that when Ai is short, i.e. \Ai\ < N = 100, then A'^ is short as well and when Ai is 
long, i.e. I^il > then \A'^\ < 

It is left to show that the first word shortens by at least one letter in each phase. Consider that if a 
letter a is left-popped from X then the same letter is removed from the front of the first word in order 
to preserve ([T]). Furthermore the right-popping does not affect the first word at all (as X is not to its 
right); the same applies to cutting the prefixes and suffixes. Hence the length of the first word is never 
increased by popping letters. Moreover, if at least one compression (be it block compression or pair 
compression) is performed inside the first word, its length drops. So consider the first word at the end 
of the phase let it be Aq. Note that there is no letter representing a compressed pair or block in Re- 
consider for the sake of contradiction the first such letter that appeared in the first word. It could not 
appear through a compression inside the first word (as we assumed that it did not happen), cutting 
prefixes does not introduce compressed letters, so it could only come through a left-pop from X. But 
then the first letter of Aq is the same, contradiction, as this was not the first compressed letter in Aq. 

So in Aq there are no compressed letters. Let ab be the first two letters in Aq. If a = 6 then they 
should have been compressed by blocks compression, contradiction. If a 7^ 6 then they were listed 
either in Non-Crossing or Crossing and so should have been compressed, contradiction. 

Now, consider a solution S{X). We know that S{X) is either a prefix of Aq or of the form AqA, 
where R is a prefix of Aq. In the first case, S{X) is compressed as a substring of a word. In the second 
observe that argument follows as long as we try to compress the pair between Aq and Aq. But since 
the first letter of S{X) and Aq is the same, this pair is one of the crossing pairs. So, the claim of the 
lemma holds for S{X) as well. □ 

The correctness of the algorithm follows from Lemmata [7] (for BlockComp), Lemma |5](for PreProc), 
Lemma [2] (for PairCompNCr), Lemma [4] (for PairComp) and from the lemma below, which deals with 
TestSolution. 

Lemma 9. For a we can report all solutions in which S{X) E for some natural £ in 0(|^| -|-|;S|) 
time. There is either exactly one i for which S{X) = is a solution, S{X) = is a solution for 
each i or there is no solution of this form. 

Proof. The construction and proof is similar as in Lemma [l| Suppose that S{X) = is a solution of 
A = B. We calculate the length of the a-prefix of S{A) and S{B). Consider first letter other than a in 
A, let it be in the A)^^ and suppose that there were £a letters a before it (if there is non such letter, 
imagine we attach an 'ending marker' to both A and B, which then becomes such letter). The clearly 
the length of the a-prefix of S{A) is kA - i + £a- Let additionally A' be obtained from A by removing 
those letters a and variables in between them. Similarly, define kB, £b and B'. Then the length of the 
o-prefix of S{B) is ks • £ + £3- 

The substitution S{X) = clearly is a solution if and only ii kA ■ £ + £a = ks • £ + £3 and 
iS'(^') = S{B'). Consider the number of natural solutions of the equation 

kA ■ X + £a = ks ■ X + £3 : 



ONE- VARIABLE WORD EQUATIONS IN LINEAR TIME 



9 



no natural solution: clearly there is no solution of the word equation A = B; 

one solution x = £: then S{X) = is the only possible solution among a* of ^ = B. To verify 

whether S satisfies A' = B' it is is enough to run TestSimpleSolution(a) on it and see whether 

it returns i. 

satisfied by all natural numbers: then the a-prefixes of A and B are of the same length for 
each S{X) £ a*. We thus repeat the procedure for A' = B' , shortening them so that they obey 
the form ([T|, if needed. Clearly, solutions in a* of A' = B' are exactly the solutions of ^ = ;S 
m a . 

The stopping condition for the recurrence above are obvious: if A' and B' are both empty then we 
are done (each S{X) = is a solution of this), if exactly one of them is empty and the other is not, 
there is no solution at all. 

Lastly, observe that the cost of the subprocedure above is proportional to the amount of read letters, 
which are not then read again, so the running time is 0(1-41 + \B\) □ 

Lemma 10. One phase of OneVarWordEq can be performed in 0{\A\ + \B\) time. 

Proof For grouping of pairs and blocks we use RadixSort, to this end it is needed that the alphabet 
of (used) letters can be identified with an interval of numbers, of size 0(|^| + \B\). In the first phase 
of OneVarWordEq this follows from the assumption on the input. In fact, this assumption can be 
weakened a little: it is enough to assume that at the if F C {1, 2, . . . , poly(|^| + |^|)}: in such case we 
can use RadixSort to sort T in time 0(|^| + \B\) and then replace T with set of consecutive natural 
numbers. At the end of this proof we describe how to bring back this property at the end of the phase. 

To perform BlockComp we want for each letter a appearing in the equation to have lists of all 
maximal a-blocks appearing in ^ = ;B (note that after CutPrefSuff there are no crossing blocks, see 
Lemma [g]). This is done by reading A = B and listing triples {a,k,p), where k is the length of a 
maximal block of os of length k and p is a pointer to the beginning of this appearance. Notice, that 
the maximal block of a's may consist also of prefixes/suffixes that were cut from X by CutPrefSuff. 
However, by Lemma[T]such a prefix is of length at most |j4o| < |.4| + \B\ (and similar analysis applies 
for the a suffix). Then each maximal block includes at most one such prefix and one such suffix thus 
the total length of the a maximal block is 3(|„4| + 1-61). Hence, the triples (a, k,p) can be sorted by their 
first two coordinates using RadixSort in total time 0(|w4| + \B\). We go through the list of maximal 
blocks. For a fixed letter a, we use the pointers to localise a's blocks in the rules and we replace each 
of its maximal block of length £ > 1 by a fresh letter. Since the blocks of a are sorted, all blocks of 
the same length are consecutive on the list, and replacing them by the same letter is easily done. 

To compress all non-crossing pairs, i.e. to perform the loop in line line [7] we do a similar thing as 
for blocks: we read both A and B, whenever we read a pair ab for a ^ b, we add a triple (a, b,p) to 
the temporary list, where p is a pointer to this position. Then we sort all these pairs according to 
lexicographic order on first two coordinates, we use RadixSort for that. Since in each phase we number 
the letters appearing in A = B using consecutive numbers, this can be done in time 0(|w4| + \B\). The 
appearances of the crossing pairs can be removed from the list: by Lemma [5] there are at most two 
crossing pairs and they can be easily established (by looking at ^o-'^^i)- So we read the sorted list of 
pairs appearances and we remove from it the ones that correspond to a crossing pair. Lastly, we go 
through this list and replaces pairs, as in the case of blocks. Note that when we try to replace ab it 
might be that this pair is no longer there as one of its letters was already replaced, in such a case we 
do nothing. 

We can compress each of the crossing pairs naively in 0(1^1 + \B\) time by simply first applying 
the popping and then reading the equation form the left to the right. 

It is left to describe, how to enumerate (with consecutive numbers) letters in F at the end of each 
phase. Firstly notice that we call easily enumerate all letters introduced in this phase and identify 
them (at the end of this phase) with {1, . . . ,m}, where m is the number of introduced letters. Next 
by the assumption the letters in F (from the beginning of this phase) are already identified with a 
subset of {1, . . . , 1^1 + we want to renumber them, so that the subset of letters from F that are 
present at the end of the phase is identified with {m + 1 , . . . , m + m'} for an appropriate m' . Note 
that each block of a'^ for k > 2 was replaced, so only letters with maximal block of length 1 may have 
remained. To identify them we can simply extend the algorithm for block compression: it should also 
make a separate list in which maximal blocks of length 1 are listed (with pointers to appearances), 
this list is also sorted according to the letter value. We then go through the list of those letters and 
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when dealing with a we use the pointers to the appearances of a to estabhsh, whether a is stih in the 
current equation A = B. If so, we assign a the next free number. Clearly the whole process takes at 
most 0(1^1 + 1^1) time. □ 

Lemma 11. As soon as first or last word becomes short, the rest of the running time of OneVarWordEq 
is 0{n). 



Proof This is fairly simple: one phase takes 0(1^1 + \B\) time by Lemma 10 (this is at most 0{n) by 
Lemma [s]) and as Lemma [s] guarantees both first word and last word are shortened by at least one 
letter, there will be at most N many phases. Lastly, Lemma |9] shows that TestSolution also runs in 
0{n). □ 

Lemma 12. The running time of OneVarWordEq till one of first or last word becomes short is 0{n + 
(n^ + nB)logn). 



Proof. By Lemma 10 the time of one iteration of OneVarWordEq is 0(1-4.1 + We distribute the 
cost among the A words and B words: we charge /3\Ai\ to A z-word (similarly to for B words), for 
some positive (3. Fix A i-word, we separately estimate how much was charged to it when it was a long 
and short word. 

long: Let Ui be the initial length of this word. Then these costs are at most + |/3nj + 1 /3nj + 
. . . < APm. 

short: Since A i-word is short, its length is at most A^, so we charge at most N/3 to it. Notice, 
that there are 0(logn) iterations of the loop in total, as first word is of length at most n and 
it shortens by | in each iteration when it is long and we calculate only the cost when it is 
long. Hence the cost charged in this way is at most 0(logn), summing over all words yields 
O {{n A + nt3) log n). □ 

4. Heuristics and Better Analysis 

The intuition gained from the analysis in the previous section is that the main obstacle in the linear 
running time is the necessity of dealing with short words, as the time spend on processing them is 
difficult to charge. This applies to both the compression performed within the short words, which 
does not guarantee any reduction in length, and to testing of the candidate solutions, which cannot 
be charged to the length decrease of the whole equation. 

The improvement to linear running time is done by four major modifications, which are described 
in details in the following subsections: 

several equations: Instead of a single equation, we store a system of several equations and look 
for a solution of such a system. This allows removal of some words from the equations and 
thus decreases the overall storing space and testing time, 
small solutions: We identify a class of particularly simple solutions, called small, and show 
that a solution is reported within 0(1) phases from the moment when it became small. In 
several problematic cases of the analysis we are able to show that the solutions involved are 
small and so it is easier to charge the time spent on testing them, 
storage: The storage is changed so that all words are represented by a structure of size propor- 
tional to the size of the long words. In this way the storage space decreases by a constant 
factor in each phase and so the running time (except for testing) is linear, 
testing: The testing procedure is modified, so that the time it spends on the short words is 
reduced. In particular, we improve the rough estimate that one TestSimpleSolution takes time 
proportional to the equation to an estimation that actually counts for each word whether it 
was included in the test or not. 

4.1. SufRx arrays and Icp arrays. A suffix array SA[l..m] for a string 7i;[l..m] stores the m 
non-trivial suffixes of w, that is w[m] , w[m — 1 . . m], . . . , w[l . . m] in (increasing) lexicographical order. 
In other words, SA[A;] = p if and only ii w[p . . .m] is the k-th suffix according to the lexicographical 
order. It is known that such an array can be constructed in 0(m) time [U] assuming that RadixSort is 
applicable to letters, i.e. that they are integers from {1,2, . . . , m^} for some constant c. 

Using a suffix array the equality testing for substrings of w reduces to the longest common prefix 
(Icp) query: observe that w[i . .i + k] = w[j . . j + k] if and only if the common prefix oi w[i . .m] and 
w[j . .m] is at least k. The ffist step in constructing a data structure for answering such queries is 
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the LCP array: for each z = 1, . . . , m — 1 the LCP[i] stores the length of the longest common prefix 
of SA[i] and SA[i + 1]. Given a suffix array, the LCP array can be constructed in linear time [10], 
however, the linear-time construction of suffix arrays can be in fact extended to return also the LCP 
array ^ 

When the LCP array is supplied, the general longest prefix queries reduce to the range mini- 
mum queries: the longest common prexif of SA[i] and ^^[j] (for i < j) is the minimum among 
LCP[i], . . . ,LCP[j — 1], and so it is enough to have a data structure that answers the queries about 
the minimum in the range in constant time. Such data structures in general case are known and in 
case of LCP arrays even simpler construction were given [T] . 

4.2. Several equations. The improved analysis assumes that we do not store a single equation, 
instead, we store several equations and look for substitutions that simultaneously satisfy all of them. 
Hence we have a collection Ai = Bi of equations, for i = 1, . . . , m, each of them is of the form described 
by ([T]) ; by ^ = ;B we denote the whole system of those equations. In particular, each of those equations 
specifies the first and last letter of the solution, length of the a-prefix and suffix etc., exactly in the 
same way as it does for a single equation. However, it is enough to use only one of them, say = Bi, 
as if there is any conflict then there is no solution at all. The consistency is not checked, simply when 
we find out about inconsistency, we terminate immediately. 

The system of equations stored by OneVarWordEq is obtained by replacing one equation A[A'{ = 
B'j^B'l with equivalent two equations A!^ = B[ and A!l = B'l (note that in general the latter two equation 
are not equivalent to the former one, however, we perform the replacement only when they are). Hence 
we store words using the same tables as before, simply for each word we additionally keep information 
whether it is a first/last word in some equation. The first (last) word of Ai has a link to the last word 
of Ai-i and Ai (the first word of Ai and Ai+i, respectively). We also say that Ai {Bj) is first or last if 
it is in any of the stored equations. The original equation implies a natural order on the equations in 
the system: among the two equations Ai = Bi and Aj = Bj the one whose words correspond to earlier 
words from the input equation is earlier. This order is followed whenever we perform any operations 
on all words of the equations. 

All operations on a single equation introduced in the previous sections (popping letters, cutting 
prefixes and suffixes, pair compression, blocks compression) generalises easily to a system of equations 
and they preserve their properties and running times, with the length of a single equation replaced by 
a sum of lengths of all equations. However, some lemmata require a little care: firstly, PreProc should 
ensure that there are only two crossing pairs. This is the case, as each X in every equation is replaced 
by the same aXb and S{X) is the same for all equations, which is the main fact used in the proof of 
Lemma [H 

Secondly, Lemma [8] ensured that in each phase the length of the first and last word is decreased. 
Currently the first words in each equation may be different, however, the analysis in Lemma |8] applies 
to each of them. 

4.3. Small solutions. We say that a word w represented a.s w = wiit^l'U^s (where I is arbitrary) is 
almost periodic^ with period size \w2\ and side size \wiW'i\ (note that several such representation may 
exist, we use this notion for a particular representation that is clear from the context). A substitution 
S is small, if S{X) = {w)^v, where w, v are almost periodic, with period size at most and side size 
at most 6A. 

Lemma 13. Suppose that S{X) is a small solution. There is a constant c such that within c phases 
the corresponding solution is reported by OneVarWordEq. 

Proof. Note that for sure a substitution is tested when it is reduced to one letter. So it is enough to 
show that a small solution is reduced to one letter within 0(1) phases. 

Intuition is as follows: consider an almost periodic word, represented as W1W2WS. Ideally, all 
compressions are done separately on wi, and each W2. In this way we obtain a string w'iW2w'^ and 
from Lemmajsjit follows that \w'^\ < ^\wi\. After ©(log 11^21) steps we obtain a word W1W2WS in which 
W2 is a single letter, and so in this phase wf is replaced with a single letter. Then, since the length of 
wi and W3 is at most 6A^, after 0(1) phases this is reduced to a single letter. Concerning the small 
solution, uw^v we first make such an analysis for w, when it is reduced to a single letter (after 0{1) 
phases) after one additional phase = is also reduced to one letter (by BlockComp) and so the 
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Figure 1. The alternative factorisation. The first compressed letters are in grey. For 
simplicity i = r = 1. Each z' between the cds is compressed independently. 

obtained string u'a^v' is a concatenation of three almost periodic strings. Using the same analysis as 
above or each of them we obtain that it takes 0(1) time to reduce them to a single letter. 

In reality we need to take into the account that some compression are made on the crossings of the 
considered strings, however, we can partition the strings so that the result is almost as in the idealised 
case. We begin with showing an appropriate claim in the borderline cases. 

Claim 1. Consider almost periodic substring uw^v of a solution S{X) with period size at most N 
and size side at most 6A^. If = 1 or it; is a block of letters, this is replaced with one symbol in 0(1) 
phases. 

Proof. If k = 1 then S{X) = uwv, where \uvw\ < 7 ■ N and by Lemma [s] in 0(1) this is reduced to a 
single-letter solution which is then reported. If w; is a block of the same letter then it is replaced with 
a single symbol in one phase, reducing to the previous case. □ 

We say that for a string uwv during one phase of OneVarWordEq the letters in w are compressed 
independently, if every compressed pair or block were either wholly within this w or wholly outside 
this w (in some sense this corresponds to the non-crossing compression). 

Claim 2. Consider almost periodic substring uw^v of a solution S{X) and suppose that w is not a 
block of one letter. Then there is a partition of this string into u'w'^~'^v' such that 

• \w'\ = \w\ (and consequently \u'\ + \v'\ = |m| -|- \v\ -\- 2\w\) 

• the form of w' depends solely on w and does not depend on u, v (it does depend on the equation 
and on the order of blocks and pairs compressed by OneVarWordEq) 

• the compression in one phase of OneVarWordEq compresses each w' from w'^~'^ independently. 

Proof. Let w = a^zV, where a, 6 S S, ^, r > 1 and z G S* does not start with a nor it ends with 
h. Let us represent uw^v as u{a^zU')^v and consider the first compression performed fullywithin the 
z}f{a^zh'^)^~'^a^z, clearly some compression is made wholly within this blocks, see Fig. Ill Suppose 
for simplicity, that the first to be performed is a compression of pair, say cd. The case, when this 
first compression is a block compression is done similarly. We claim that all pairs cd that appeared 
within this fragment zb'^{a^zb'^)^~'^a^z at the beginning are compressed at this moment. Assume for 
the sake of contradiction that this is not true. So this means that one of the letters, say c was already 
compressed in some other compression performed earlier. By the choice of the compressed pair, this 
c is compressed with a letter from outside of the fragment zh{a^z}f)^~'^az, there are two possibilities: 
c is the last letter of zV {a^ zh'^)^~'^ z: Observe that the letter succeeding c is either 6 or a 
letter representing a compressed pair /string. In the latter case we do not make a further 
compression, so it is b. This is a contradiction: each c that is a last letter of z was initially 
followed by 5, and so in fact this compression was performed earlier, contradiction with a 
choice of the compressed pair (as the first one fully within zb^{a^zb^)^~'^a^z). 
c is the first letter of zb^{a^zb'')^~'^a^z: The argument is symmetric, with a preceding c in this 
case. 

There are at least k — 1 such pairs, each of them separated by \w\ — 2 letters, i.e. the {cdz')^~'^cd is a 
substring of zV\a^zb''')^~'^a^z, for some z' of length \w\ — 2, see Fig. [l| We take w' = cdz' and let u' be 
the u concatenated with string proceeding the {cdz')^~'^cd and v' the v concatenated with the string 
following this {cdz')^~'^ (note that it includes the ending cd). Clearly \w'\ = \w\ and consequently 
l^'l + l^'l = |ii|-|-|'(;| + 2|tt;|. Note that each w' begins with cd, which is the first substring even partially 
within w' that is compressed, furthermore, each of those w' is also followed by cd. So the compression 
inside w' is done independently (because by the choice of cd there was no prior compression applied 
in w'). □ 

Consider a small solution, i.e. S{X) = {w)^v, where w, v are almost periodic with period size N and 
side size 6A^. We claim that after each phase of OneVarWordEq, the solution will be of more general 
form 
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Claim 3. Suppose that a solution S{X) is small. Then during the OneVarWordEq S'{X) = u' {w')^' v' , 
where 

• m' is a word 

• w' is almost periodic with its period size at most | of the period size (side size, repspectively) 
from the previous phase 

• f ' is a concatenation of almost periodic words, the sum of their period-sizes is at most 2>N 
while the sum of their side-sizes plus the length of u' is at most 24iV 

Proof. Clearly those conditions are met in the beginning. 

So consider uw^v at the beginning of one phase in the general case, in which we can apply Claim [2] 
to each w in uw^v. Let w = W1W2WZ. Then by this claim we can represent wiW2W^ as w'iW2~'^ w'^., 
where \w'2\ = \w2\ and compression of each w'2 is independent (also, \w'i \ -|- 1^3! = \ wi \ -|- l^s] -|- 2|i(;2|). 
Consider then uw^v which can be represented as 



w'2 ^"^3] V = Uw'i (w2 "^w'^w'i^ W2 ^w'^V 



Observe that each W2~'^w'^w'i is delimited by w'2 (it includes it in the left end and to the right there 
is a copy of it which is not inside this w'2~'^ w''^w'i) and each w'2 is compressed independently, so also 
each w'2~'^w'']^w'i is compressed independently, so in particular it is compressed in the same way. Thus 
uw^v is compressed into u'w'^~^v', let us estimate their sizes. 

The \ W2~ "U^s^'i ) is going to be compressed to some \ W2 ~ w^wlj , where \w'2 \ < ||^^^2l 
l^^i'U^sl < f I^Wsli hence both the period-size and side-size reduce by 1/4. 

Concerning u', it is obtained as a compression of uw'i, so it has length at most Concerning 
v' , note that after prepending w'2~'^w'^v it is again a concatenation of periodical words, however, the 
sum of period-size is increased to and the side-size increased by \w'^\. Now, using Claim |2] those 
almost-periodical word can be refactored, so that the compression within each of the periodic part 
is independent (note that w'2~'^ does not need to be refactored). In this way the periodic-size is 
unchanged and the side-size increases by at most 6A^ (i.e. twice the period-size). Afterwards, this is 
all compressed and so the lengths reduce by at least one fourth. So the new period size is at most 

^{N + 3,N) = 3iV , 

as claimed, while \u'\ plus the sum of side-sizes of v' is at most 

3 3 

'uw'i \ + - I 11x^31-1- "original side size of f " -|- 6A^ 



4' 4 
3 



additional side-size from the refactoringV 

(d^'il + I'^^sl) + (1^1 + "original side size of u") -|- 6A^) 



4 

= ^(2N + 24iV + 6iV) 
= 24A^ 

as claimed. 

We are left to consider the border cases, in which W2 is either a (non-trivial) block of letters or a 
single letter. Consider then the case in which W2 is a block of a single letter. Then during the block 
compression W2 is replaced with a single letter and the rest of the proof follows in a similar way. In 
particular, the claim holds in this case. When W2 is a single letter the whole W2 is replaced with a 
single letter, which we can include in w'l or w'^ with w'2 = e, then the claim holds as well. □ 

Using Claim [3] we obtain that after 0(1) phases the S{X) = w^v is reduced to S'{X) = u'v', where 
v' is a concatenation of almost periodic words such that the sum of their period-size is at most 3A^ and 
the sum of their side size plus \u'\ is at most 24 A^. Applying iteratively to them Claim [2] we obtain 
that after 0(1) phases this is replaced with a single letter. □ 

4.4. Storing of an equation. While the long words are stored exactly as they used to, the short 
words are stored more efficiently: we keep a table of short words and the pointer from the word-table 
points to the table of short words, in this way all identical short words are stored only once. We say 
that such a representation is succinct and its size is the sum of lengths of words stored in it. Note 
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that we do not include the size of the word table. Additionally, all long words are kept on a doubly 
linked list (recall that each of them stores whether it is the first or the last word). Note that in this 
way we do not need to actually read the whole equation in order to compress it: it is enough to read 
the words in the succinct representation. 

We first show that such a storage makes sense, i.e. that if two short words become equal, they 
remain equal in the following phases. 

Lemma 14. Consider any words A and B in the input equation. Suppose that during OneVarWordEq 
they were transformed to A' = B' , none of which is a first or last word in one of the equations. Then 
A = B if and only if A' = B' . 

Proof. By induction on operation performed by OneVarWordEq. Since none of the A', B' is the first 
or last word in the equation, it means that during the whole OneVarWordEq they had X to the left 
and to the right. So whenever a letter was left-popped or right-popped from X, it was prepended or 
appended to both A and B; the same applies to cutting prefixes and suffixes. Compression is never 
applied to a crossing pair or a crossing block, so after it two strings are equal if and only if they were 
before the operation. The removal of letters is applied only to first and last words, so it does not apply 
to words considered here. Partitioning the equation into subequations does not affect the equality of 
words. □ 

The compression (both pair and block) can be performed on succinct representation in linear time. 

Lemma 15. The compression in one phase of OneVarWordEq can be performed in time linear in size 
of the succinct representation. 

Proof. The long words are stored in a list and we can compress them without the need of reading 
the word table. We know which one of them is first or last, so when letters are popped from X we 
know what letters are appended/prepended to each of those words. Since they are stored explicitly, 
the claim for them follows from the analysis of the original version of OneVarWordEq. 



For the short words stored in the tables, from Lemma 14 it follows that if an explicit word A appears 
twice in the equations (both times not as a first, nor last word of the equation) it is changed during 
OneVarWordEq in the same way at both those instances. So it is enough to perform the operations on 
the words stored in the tables, doing so as in the original version of OneVarWordEq takes time linear 
in the size of the tables of short words. □ 

The words stored in the tables are of two types: normal and overdue. The normal words are 
substrings of the long words or Aq and consequently the sum of their sizes is proportional to the size 
of the long words. A word becomes overdue if at the beginning of the phase it is not a substring of a 
long word or Aq. It might be that it becomes a substring of such a word later, it does not stop to be 
an overdue word in such a case. The new overdue words can be identified in linear time: this is done 
by constructing a suffix array for a concatenation of long and short strings appearing in the equations. 

Lemma 16. In time proportional to the sum of sizes of the long words plus the number of overdue 
words we can identify the new overdue words. 

Furthermore in the same time bounds we can return for each overdue word a sorted list of its 
appearances. 

Proof. Consider all long words Aq, . . . , Am (with or without multiplicities, it does not matter) and 
all short (not already overdue) words A\, . . -A'^,, without multiplicities; in both cases this is just a 
listing of words stored in the representation (except for old overdue words). We construct a suffix 
array for the string 

Al$Ai$...AmU[$...A'^,#, 
which can be done in linear time [U]. 

Now A[ is a factor in some Aj (the case of Aq is similar, it is omitted to streamline the presentation) 
if and only if for some suffix A'^ of Aj the strings A'^$Aj+i . . . ^^$^1$ ■ • • and A[$ . . . 

have a common prefix of length at least \A[\. In terms of a suffix array, the entries for . . . $A'^,^ 
and Aj$Aj^i . . . . . • $A'^,# should have a common prefix of length at least \A'^\. Recall that 

the length of the longest common prefix of two suffixes stored at positions p < p' in the suffix array 
is the minimum of LCP[p\, LCP[p -|- 1], . . . , LCP[p']. For fixed suffix ^4'^$ . . . %A'^,jj^ consider the 
A'-%Ajj^i . . . %Am%A'i% ■ ■ ■ $A'^,^ with which it has the longest common prefix. Then it is either the 
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Figure 2. A is arranged against B. The periods of length at most \B\ — \A\ are in 
hgther grey. Since A^ at least one of the is non-empty. 



first previous or first next suffix of this form in the suffix array. Thus the appropriate computation can 
be done in linear time: we first go down in the suffix array, storing the last spotted entry corresponding 
to a suffix of some long Aj, calculating the LCP with consecutive suffixes and storing them for the 
suffixes of the form A'j% . . . %A'^,^. We then do the same going from the bottom of the suffix array. 
Lastly, we choose the larger from two stored values; for A^% . . . $A'^,# it is smaller than \A'-\ if and 
only if yl'j just became an overdue word. 

Concerning the running time, the size of the string is proportional to the length of the long words 
plus the length of the words that are still normal, i.e. not overdue (and this is proportional to the 
length of the long words) plus the size of the new overdue words. As each of the latter is of size at 
most N, the sum of their sizes is proportional to their number. □ 

The main property of the overdue words is that they can be removed from the equations in 0(1) 
phases after becoming overdue. This is shown by a serious of lemmata. 

First we need to define what does it mean that for solution word A in one side of the equation is at 
the same position as its copy on the other side of the equation: we say that for a substitution S the Ai 
(or its subword) is arranged against Bj {S{X) for some fixed appearance of X) if the position within 
S{A) occupied by this (subword of) Ai are within the positions occupied by Bj {S{X), respectively). 

Lemma 17. Consider a word A in a phase in which it becomes overdue. Then for each solution S{X) 
either S is small or in every S{Ai) = S{Bi) each word A is arranged against another instance of A. 

Proof. Consider an equation and a solution S such that in some S{Ai) = S{Bi) a factor coming from 
an overdue word A is not arranged against a factor coming from an overdue word A. There are several 
cases: 

A is arranged against S{X). Note that in this case A is a substring of S{X). Either S{X) is a 
substring of Aq or S{X) = ^q^q, where A'q is a prefix of ^o- In the former case ^ is a factor of 
which is a contradiction, in the latter it is a factor of ^q^^. As Aq is long and A short, it follows that 
|j4| < |j4o| and so yl is a factor of A^, contradiction with the assumption that A is overdue. 

A is arranged against some word. Since A is an overdue word, this means that it is arranged against 
short word B. Note that both A and B are preceded and succeeded by S{X)^ since A^ B we conclude 
that S{X) has a period at most \B\ — \ A\, see Fig. [2| in particular S is small. 

Other case. Some part of A is arranged against S[X) and as A is preceded and succeeded by -S'(X), 
which means that S{X) has a period at most \A\, similarly as in the previous case; so in particular S 
is small. □ 



Observe that due to Lemmata 13 and 17 the overdue words can be removed in 0{1) phases after 
their introduction: suppose that A becomes an overdue word in phase Any solution, in which an 
overdue word A is not arranged against another copy of A is small and so it is reported after 0(1) 
phases. In the following phase an equation A[XAXA'l = B'^XAXB'- , where Ai and Bi do not have 
A as a word, is equivalent to two equations A'i = B'i and A!l = B'l and this procedure can be applied 
recursively to A!l = B". In this way, all appearances of A are removed and no solutions are lost in the 
process. There may be many overdue strings so the process is a little more complicated, however, it 
can be implemented with ease in time proportional to the size of the removed overdue words and so 
this time summed over all phases is 0(n). 

Lemma 18. Consider the set of overdue words introduced in phase i. Then in phase £ + c (for some 
constant c) we can remove all those words from the equations. The obtained set of equations has the 
same set of solutions. The amortised time spend on removal of overdue words, over the whole run of 
OneVarWordEq, is 0(#x). 
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Proof. Consider any word A that become overdue in phase i and any solution S of this equation, such 
that in some S{Ai) = S{Bi) the exphcit word A is not arranged against another instance of the same 



exphcit word. Then due to Lemma 17 the S{X) is small. Consequently, from Lemma 13 this solution 
is reported before phase £ + c, for some constant c. So any solution S' in phase H. + c corresponds to 
a solution S from phase I that had each explicit A arranged in each S{Ai) = S{Bi) against another 
instance of the explicit A. Since all operations in a phase either transform solution, implement the 
pair compression of implement the blocks compression for a solution S{X), it follows that in phase 
I + c the corresponding overdue words A' are arranged against each other in S'{A!^) = S'{B'j). 

This observation allows removing all overdue words introduced in phase I. Let Ci, C2, . . . , Cm 
(in phase ^ + c) correspond to all overdue words introduced in phase For each overdue word A 
the list of pointers to its appearances in left-hand sides of the equations and right-hand sides of the 



equations are sorted according to the order of their appearances, see Lemma 16, We go through 
those lists, if the first appearances of A in the left-hand sides and right-hand sides are in different 
equations then the equations are not satisfiable, as this would contradict that in each solution both A 
is arranged against its copy. Otherwise, they are in the same equation Ai = Bi, which is of the form 
A[XAXA'- = B'j^XAXB'^, where A'^ and B[ do not have any appearance of A within them. We split 
it into two equations A[ = B[ and A!l = B'l and we trim them so that they are in the form described 
m Note that as new equations are created, we need to reorganise the pointers from the first/last 
words in the equations, however, this is easily done in 0{\) time. The overall cost can be charge to 
the removed X, which makes in total at most 0{^x) cost. □ 

Lemma 19. The running time of OneVarWordEq, except for time used to test the solutions, is 0{n). 



Proof. By Lemma 15 the cost of compression is linear in terms of the size of the succinct representation. 
When the overdue words are excluded, this size is proportional to the total length of long words. Since 
by Lemma |8] this sum of lengths decreases by a constant in each phase, the sum of those costs is linear 
in n. 

Consider the cost related to the overdue word. Note that a word is overdue for only 0(1) phases. 



see Lemma 18 So the compression time for them is only 0(1) per word, which sums to 0(n) where 
all words are included. It was already shown in Lemma [16] that overdue words can be identified in 
time proportional to the sum of lengths of long words plus the number of new overdue words, so this 



cost is 0(n), when summed over all phases. Lastly, Lemma 18 shows that also the time spend on 



removing those words from the equations is 0(n). □ 



4.5. Testing. We already know that storing and compressing the equations can be performed in linear 
time, it still remains to explain how to test the solutions fast, i.e. how to perform TestSimpleSolution 
when all first and last words are still long (note that TestSolution is performed once, so it can take 
time proportional to n). Recall that TestSimpleSolution checks whether S" is a solution by comparing 
S{Ai) and S{Bi) letter by letter, replacing X with on the fly. We say that in such a case a letter 
b in S{Ai) is tested against the corresponding letter in S{Bi). Consider two letters, from Ai and -Bj, 
that are tested against each other. If one of Ai and Bj is long, this can be amortised against the 
length of the long word. The same applies when one of the words ^Ij-i, Aj+i, Bj-i or -Bj+i is long. 
So the only problematic case is when all of those words are short. To deal with this case efficiently 
we distinguish between different test types, in which we exploit different properties of the solutions to 
speed up the tests. 

4.5.1. Test types. Suppose that for a substitution S a letter from Ai is tested against a letter from 
S{XBj) (note that there is some asymmetry regarding words and i3- words in the definition, this is 
a technical detail without an importance). We say that this test is: 

protected: if at least one of Ai, Aj+i, Bi, is long 

failed: if Ai, ylj+i, Bj and -Bj+i are short and a mismathch for S is found till the end of j4j+i 
or Bjj^i; 

aligned: if Ai = Bj and v4j+i = -Bj+i, all of them are short and the first letter of Ai is tested 

against the first letter of Bj; 
misaligned: if all of Ai, ^Ij+i, Bj, Sj+i are short, j4j+i 7^ Ai or Bjj^i ^ Bj and this is not an 

aligned test; 

periodical: if ^j+i = Ai, -Bj+i = Bj, all of them are short and this is not an aligned test. 
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It is easy to show that there are no other tests, see Lemma 20, We separately calculate the cost of 
each type of tests. For failed tests note that they take constant time per phase and we know that 
there are O(logn) phases. For protected tests, we charge the cost of the protected test to the long 
word and only 0(|C|) such tests can be charged to one long word C in a phase. On the other hand, 
each long word is shortened by a constant factor in a phase and so this cost can be charged to those 
removed letters and thus the total cost of those tests (over the whole run of OneVarWordEq) is 0(n). 

In case of the misaligned tests, it can be shown that S in this case is small and that it is tested at 
the latest 0(1) phases after the last of ^j+i, Ai, Bi becomes short, so this cost can be charged 

to, say, Bi becoming short and only 0(1) such tests are charged to this Bi (over the whole run of the 
algorithm). Hence the total time of such tests is 0{n). 

For the aligned tests, consider the consecutive aligned tests, they correspond to comparison of 
AiXAi^i . . . Ai^kX and BjXBj^i . . . Bj^^X, where Ai^£ = Bj^i for i = 1, . . . ,k. Then it can be 
shown that the previous test is either misaligned or protected, so if the cost of all those tests can be 
bounded by 0(1), they can be associated with the succeeding test. Note that instead of performing 
the aligned tests, it is enough to identify the maximal (syntactically) equal substrings of the equation 



and from Lemma 14 it follows that this corresponds to the (syntactical) equality of substrings in the 
original equation. Such an equality can be tested in 0{1) using a suffix array constructed for the input 
equation (and general LCP queries on it). 

For the periodical test suppose that we are to test the equality of (suffix of) S{{AiXY) and (prefix 
of) S{X{BjX)^). If \Ai\ = \Bj\ then the test for Aj+i and -Bj+i is the same as for Ai and Bj and 
so can be skipped. If |ylj| > \Bj\ then the common part of S{{AiXY) and S{X{BjXY) have periods 
|5(yl.jX)| and \S{BjX)\ and consequently has a period \Ai\ — \Bj\ < N. So it is enough to test first 
common \Ai\ — \Bj\ letters and check whether [^(^jX)! and \S{BjX)\ have period \Ai\ — \Bj\. 

This yields that the total time of testing is linear. The details are given in the next subsections. 

Lemma 20. Each test is either failed, protected, misaligned, aligned or periodical. 

Proof. Consider a test of a letter from Ai and S{XBj). If any of Aj+i, -Bj+i, Ai or Bj is long then it 
is protected. So consider the case in which all of them are short, it might be that this test is failed. 
Otherwise, if the first letter of Ai and Bj are tested against each other and Ai = Bj and Aj+i = -Bj+i 
then the test is aligned. Otherwise, if Aj+i 7^ Ai or -Bj+i 7^ Bj then it is misaligned. In the remaining 
case Aij^i = Ai and -Bj+i = -Bj, so this is a periodical test. □ 

Note that when calculating the time spend on testing, we can consider only the one in which at 
least one letter is from a word: the tests in which both letters are from S{X) take only constant time 
per test (as we compare the suffix of a'^ with we can move forward the whole suffix) and so we can 
charge it to the test of a neighbouring word. 

4.5.2. Failed tests. There are 0(1) substitutions tested per phase and so 

Lemma 21. The number of all failed tests is O(logn) over the whole run of OneVarWordEq. 

Proof. As noticed, there are 0(1) substitutions tested per phase. Note that letters from at most two 
short blocks in some Ai and two short blocks in some Bi take part in the failed tests. Since short 
blocks have length at most we conclude that there are 0(1) failed tests per phase and so O(logn) 
failed tests in total, as there are O(logn) phases. □ 

4.5.3. Protected tests. As already claimed, the total number of protected tests is linear in terms of 
length of long words. 

Lemma 22. In one phase the total number of protected tests is proportional to the length of the long 
words. In particular, there are 0{n) such test during the whole run of OneVarWordEq. 

Proof. Suppose that a letter from Ai takes part in the protected test, then one of Aj, Aj+i, Bj^ Bj^i 
is long, we charge the cost of this test to this long word (any of them, if there is a choice). Fix some 
long word Ai'. It can be charged with cost of tests of its own letter (0(|^j/|), as well as protected tests 
of letters from Ai'_i and (when they are short), so 0(1) tests. Furthermore, let the first letter 

of Ai' under the tested substitution is against a letter from Bj' and the last against a letter from Bj". 
Then also the tests from (short among) Bj'_i, Bj', Bj" and Bj"^i can be charged against Ai', but 
again there are only 0(1) of them. So in total Ai' is charged only 0(|Aj/|) in a phase. 
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Figure 3. A letter from Bj is arranged against the letter from Ai. The period of 
S{X) is in grey. 
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Figure 4. The letters of Bj are arranged against the letters from S{X). The lengths 
of fragments of text are beneath the picture, between slashed lines. Comparing the 
positions of the first and second S{X) yields that p is a period, second and third that 
X — p + a. The corresponding borders of S{X) are marked in grey. 



From Lemma [8] the smu of length of long words drops by a constant factor in each phase, and as in 
the input it is at most n, the total sum of the cost of protected test is 0{n). □ 

4.5.4. Misaligned tests. We first generalise the notion of a misaligned test to a more general setting, 
in which S is an arbitrary substitution and not a tested one. We say that Ai and Bj that are blocks 
from two sides of one equations Ai = Bi are misaligned for a substitution S if 

• a mismatch for 5* is found till the end of ^i+i or -Bj+i; 

• all Ai^i, Ai, Bj j^i and Bj are short; 

• they are not aligned, i.e. it does not hold that Ai = Bj and Ai+i = -Bj+i and the first letter 
of Ai is at the same position as the first letter of Bj under substitution 5; 

• the position of the first letter of Ai in S[At) is among the position of S{XBj) in S{Bg). 

Lemma 23. When the Ai and Bj are misaligned for a solution S then S is small. 

Proof. Suppose that Ai and Bj are from an equation Ag = Bg. In the proof we consider only one of 
the symmetric cases, in which Ai begins not later than Bj, the remaining case is shown in the same 
way. 

By definition, each of Aj+i, Ai, -Bj+i and Bj is short and consequently is not the first word in 
Ag = Bg. So consider how S{XAiXAi+iX) and S{XBjXBj+iX) are arranged against each other in 
Ag = Bg. If any letter of Bj is arranged against a letter of Ai and Ai ^ Bj then S{X) is periodic, 
with period smaller than \Ai\ < N, see Fig. [sj If Bj = Ai and their first letters are against each other 
then as Aj+i ^ Bj^i and their first letters are also arranged against each other then we can make the 
same analysis as in the previous case, this time for Aj+i and -Bj+i. In each of those cases S is small. 
The same argument can be applied to the letters of (note that when = Aj+i and their first 
letters are aligned we look at Ai and Bj). 

So, in the following we may assume that letters of Bj (Bj^i) are arranged against the letters from 
S{X). Let a = \Ai\, b = \Bj\ and x = \S{X)\, as in Fig. |4j Consider first the (main) case, in which 
a 7^ 5, we consider the case in which a > b, the other is similar. Let p denote the offset between the 
S{X) preceding Ai and the one proceeding Bj, see Fig. |4j Then S{X) has a borders of length x — p 
and X — p + a — b, see Fig. |4j Then the shorter border (of length x — p) is also a border of the longer 
one (length x — p + a — b), hence the border of length x — p + a — b has a period a — b. Thus the prefix 
of S{X) of length x — p is of the for w^u, where \w\ = a — b and \u\ < a — b. Then the prefix of S{X) 
of length X — p + a (consider the second S{X) on Fig. |4| is of the form w^uAi. But as S{X) has a 
period x — p + a, the whole S{X) is of the form [w^uAiYw' , where w' is a prefix of w^uAi, hence it 
is small. 

So consider now the case, in which = If ^ \Ai\ then \Bj\ ^ |^i+i| and we can 

repeat the same reasoning as above, with Bj and ^j+i taking the roles of Ai and Bj, which shows 
that S is small. So consider the case in which |j4j+i| = lAjj. If 7^ l-^i+il '^^ also obtain that S is 
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small. So we are left with = and \Bj\ = \Bj^i\. Then ^j+i is arranged against the same 

letters in S{X) as Ai and Bj^i is arranged against the same letters in S{X) as Bj. Since Aj+i 7^ Ai 
or -Bj+i 7^ Bj, some of those tests fail, contradicting the assumption that is a solution. □ 

Lemma 24. There are 0{n) misaligned test during the whole run of OneVarWordEq. 

Proof. We first show that if Ai and Bj are misaligned for S then they were (for a corresponding 
solution) already when the last among A {i + l)-word, A i-word, B {j + l)-word and B j-word became 
short. Note that it is enough to show the claim for the previous phase end then apply the induction. 

Claim 4. Suppose that Ai and Bj are misaligned for S. If at a previous phase all ^i+i, A[, B'j^^ and 
Bj were short then A'^ and Bj were misaligned for the corresponding S'. 



Proof First note that by Lemma 14 it holds that A'^ 7^ ^i+i or Bj 7^ ^'j+i- additionally all A[_^i, 
A[, B'j^^ and Bj are short, then the only reason why A[ and Bj could not be misaligned for S' is that 
A'^ = Bj, A[^^ = B'j^^ and under 5' the first letters of those A'^ and Bj are arranged against each 
other. Note that left-popping and right popping letters from X does not change those equalities, nor 
the fact that the first letters of those words are arranged against each other. The same applies to pair 
compression and block compression. Hence, also Ai = Bj, Ai-^^i = -Bj+i and the first letters of Ai and 
Bj are arranged against each other, contradiction. □ 

Using Claim |4] we can give a bound on when a word can take part in a misaligned test for a solution 
S (not that this not necessarily apply to all substitutions). 

Claim 5. Let the i-th phase be the one in which the last of A^_^i, A'^, B'j^-^ and Bj became short. 
Then there is no misaligned test of letter from Ai against a letter from S{XBj) for a solution S in 
phase £ + c or later, for some constant c. 



Proof. Suppose that Ai and Bj have a misaligned test for a solution S. By inductive argument on 
Claim El it follows that A!^ and B'j are misaligned for the corresponding 5". By Lemma 23 it follows 
that S is small and so it is tested at the latest in phase I + c (for some constant c) . □ 

So whenever (in phase (.') a letter from Ai has a misaligned test against a letter from S{XBj) we 
can check, in which turn i the last among words Ai, Aj+i, Bj -Bj+i became small. If ^ + c < ^' then 
we can terminate the test, as we know already that S is not a solution. This allows an estimation 
on the number of misaligned tests. Firstly, there are at most ©(logn) of tests that terminate the 
whole testing, the argument is similar as in the proof of Lemma [21} Otherwise, the cost of the test 
(of a letter from Ai tested against S{XBj)) is charged to the last one among Ai, ^j+i, Bj, -Bj+i that 
became short. For a fixed substitution, a fixed word can be charged only from a constant number 
of short words, so 0(1) cost in total. There are only 0{\) substitutions per phase and moreover the 
word is charged only for c phases after it became short, so in total there is 0{1) charges to this word. 
Summing over all words in the instance yields the claim of the lemma. □ 

4.5.5. Aligned tests. Suppose that we test the first letter of some Ai against the first letter in Bj 
(both of those words are short), where 7^ ^j-i or one of them was long (i.e. the tests for 

previous words were not aligned). We can preprocess (in 0{n) time) the input equation (building 
a suffix array equipped with a structure answering general LCP queries) so that in 0(1) time it 
returns the smallest k > {) such that Aij^k 7^ ^j+fc- In this way we perform all equality tests for 
AiXAi+iX . . . Ai+k-iX = BjXBj+iX . . . XBj+k-iX in 0(1) time. 

Lemma 25. In 0{n) we can build a data structure such that given two Ai and Bj it in 0(1) time 
returns the smallest k >0 such that Ai^k 7^ Bjj^k- 

Proof. Let A[, B'j etc. denote the corresponding original words of the input equation. Observe that 



by Lemma 14 A'^j^^ = Bj^^ if and only if Ai^i = Bj^i. Hence, it is enough to be able to answer such 
queries for uie input equation. 

To this end we build a suffix array [2] for the input equation, i.e. for A1XA2X . . . Anj^XBiXB2X . . . Bn^^i 
Now, the Icp query for suffixes Ai . . .$ and Bj . . .$ returns the length of the longest common prefix. 
What we want is the information, is the number of words in the common prefix, which corresponds 
to the number of Xs in this common substring. And this information can be easily preprocessed and 
stored in the suffix array. For instance, we can for each position i in A1XA2X . . . An^X BiX B2X . . . Bn^ $ 
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store, how many Xs are before it in the string and store this in the table prefX. Then when for a suf- 
fixes starting a position p we get that its common prefix is of length k, we return prefX\p+k] —prefX\p] , 
which is the number of Xs in the common prefix in such a case. □ 

Using this data structure performing all aligned tests is easy: whenever we test the first letter of Ai 
against the first letter of some Bj, we use this structure, obtain k and jump to the test of the first letter 
of Ai^k with the first letter of Bj^k. If = then this is not the aligned test and we add the cost of 
using the data structure to the cost of the test. If A; > then ^j+fc 7^ ^j+fc and Ai^^-k-i = ^j+fc-i- Thus 
the tests for letters of Ai^k-i are either protected, misaligned or failed (they cannot be periodical, 
as Ai+k-i = Bj+k-i and Ai+k Bj+k implies that Ai^k A+k-i or Bj+k-i -Bj+fe)- So we can 
associate the cost of those aligned tests with the cost of a protected, misaligned or failed test, and so 
the time spend on them is at most linear over the whole run of OneVarWordEq. 

Lemma 26. The total cost aligned test and the usage of the suffix array is 0{n). 

Proof. We formalise the discussion above. When the first letter of Ai and Bj should be tested against 
each other and A^ 7^ Bj then this is not an aligned test (the cost of checking whether Ai = Bj is then 
added to the cost of the following letter-test). Otherwise this is an aligned test, we add the cost of 
checking whether Ai = Bj to the cost of the aligned test. 

In 0(1) wc get the smallest k such that Ai^k 7^ ^j+k or one of them is in the other equation. We 
then jump straight to the test of Ai^k or -Bj+fc (those are different tests if one of them is a first word 
in an equation). Consider Ai^^k-i and Bj+k-i- There are the following cases: 

one of Ai^k-i or Bj^k-i is a last word: which shows that it is long, contradiction 
Ai_^_k-iX or Bjj^k-ix ends one side of the equation: Then the other side is ended differ- 
ently (they cannot both end in X) and so the test for one of Ai^k-i or Bj^k-i is failed 
both Ai^k-i and Bj_^k-i are within the same equation: we show that in this case the test 
for the first letters of Ai^^k-i and Bj^k-i is protected or failed or misaligned. Firstly, since 
Ai+k-i = Bj+k-i and vlj+fc ^ Bj^k then Aj+fe-i 7^ yl^+fc or Bj+k-i / -Bj+fc. Hence the test 
for the first letters of and Bj^k-i cannot be aligned nor periodical. So it is misaligned, 

protected or failed. 

Hence we can associate the 0(1) cost of all aligned tests for AiX . . . Ai^k-i^ and BjX . . . 

with cither a failed, protected or misaligned test. Thus in total they take 0{n) time. □ 

4.5.6. Periodical tests. In order to identify the periodical tests we keep for each short Ai the value 
k such that Ai_^_k is the first word that is different from Ai or in another equation, those are easy to 
calculate at the beginning of each phase. Now when a periodical test is done, i.e. we test letters from 
a suffix of S{{AX)'') against the letters from (prefix of) S((BXY) then we use that S{BXY has a 
period \S{BX)\, hence it is enough to perform this test manually for S{{AXY) and S{{BXY) and 
then check whether S{{AXY) has a period \S{BX)\. All of this can be done in time 0{\A\ + \B\), so 
constant. Furthermore, the next test is either protected, misaligned or failed, which yields that the 
number of periodical tests is 0{n). 

Lemma 27. Performing all periodical tests and the required preprocessing takes in total 0{n) time 

Proof. Wc first show how to calculate for each short A^ (and Bj) the k such that ^i+fe is the first word 
that is dift'erent from Ai or it is in another equation. 

At the end of the phase we can list all words Ai that become short in this phase, ordered from the 
left to the right. Note that this takes at most the time proportional to the length of all long words 
from the beginning of the phase, so 0{n) in total. Consider any Ai on this list, note that 

• if Ai^i 7^ Ai or it is in the next equation, then Ai should store k = 1; 

• ii Ai = v4j_|_i and they are in the same equation then Ai^i also became short in this phase and 

so it is on the list and consequently Ai should store 1 more than Ai-^i. 
So we read the list from the right to the left, let Ai be an element on this list. Using the above 
condition, we can establish in constant time the value stored by Ai. This operation is performed once 
per block, so in total takes 0{n) time. 

So suppose that during the tests, when the first letter of Ai is tested against the letter of S{XBj), the 
kA and ks stored by Ai and Bj are both greater than 1. Consider the one of Aij^kA Bj^ks which 
ends first under substitution S (this can be determined in 0(1) by simply comparing the lengths), 
without loss of generality let it be the latter. There are the following cases: 
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Figure 5. The case of a > 6. The part of S{{XAi)'^) that has a period a + x and b + x 
is in grey. 

Bjj^kg 7^ Bjj^kB-i- Then the test for Bj^kg-i is not periodical. Furthermore, it cannot be 
ahgned: if it were, then Bj^ks-i is ahgned against Ai^k for some k < kA and so Bj+kB-i = 
Ai^k imphes that Ai = Bj and they are ahgned, contradiction. So it is either misahgned, 
protected or failed. 

Bj^kB-^ ends the equation: in this case either the other side of the equation does not end at 
this point, so the test for Bjj^kg_i is failed, or Aj+fc for A: < fcyi is the last word, so the test for 
the letters in Aj+fc^ are protected. 

Bj^^B 1=^^^ word: then the test for Bj^i^-g^i is failed, as we consider the case in which 

under the substitution S the Ai^/^^ ends later than Bj^i^^b- 

So either the test for -Bj+fc^ is either protected, failed , aligned or misaligned or the test for Aj+fc^ 
is protected. Since there are in total 0{n) of such tests it is enough to show that the test for the 
common part of S{AiX ■ ■ ■ XAi^^^^iX) and S{BjX ■ ■ ■ XBj^i^-g^iX) can be performed in 0(1) time, 
which is associated with the cost of the following test. 

Let a = \Ai\, b = \Bj\ and x = \S{X)\. First consider the simpler case in which a = b. Let 
k = mm(kA, ks)- Then the tests for vlj+i, . . . , Ai^k_i are identical as for Ai, and so it is enough to 
perform just the test for Ai and then jump right to ^j+fc. 

So let us now consider the case in which a > b and that the first letter of Ai under the substitution 
S begins earlier than the first letter of Bj, the other are done in the same way. Observe that when 
the whole S{{BjXY) is within S{{AiX)'^) then this can be tested in constant time in a naive way: the 
length of S{{AiX^)) is 3(a + x) while the length of S{BjXY is £{b + x). Since a/b is at most N we 
obtain that i < 3N and so the whole test can be done in 0(1). 

So consider the remaining case, see Fig. |5]for an illustration. First S{XAiXAi) has period x + a. 
However, it is covered with S{{BjXY), so it also has period x + b. Since x + a + x + b < 2x + 2a, 
it follows that also the p = gcd(j; + a,x + b) < a — 6 is a period of S{XAiXAi) and consequently, of 
S{AiX) and S{BjX). So, to perform the test for the common part of S{AiX ■ ■ ■ XAi^kA-i-^) 
S{BjX ■ ■ ■ XB.jj^kB-i^) it is enough to calculate p, test whether S{AiX), S{BjX) have period p and 
then compare their first common p letters. All of this can be done in 0(1), since p < a — b < N (note 
also that calculating p can be done in 0(1), as gcd(x + a, x + b) = gcd(a — b,x + b) and a — b<N). □ 

Open problems. Is it possible to remove the usage of range minimum queries from the algorithm 
without increasing the running time? Can the recompression approach be used to speed up the 
algorithms for the two variable word equations? 
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